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The current experimental status of the searches for the very rare decays B® —¥ /i+ 
and B° — » /i+ fi~ is discussed. These channels are highly sensitive to various extensions 
of the Standard Model, specially in the scalar and pscudoscalar sector. The recent, most 
sensitive measurements from the CDF, ATLAS, CMS and LHCb collaborations are dis- 
cussed and the combined upper exclusion limit on the branching fractions determined 
by the LHC experiments is shown to be 4.2 X 10~ 9 for B° — > and 0.8 X 10~ 9 for 

B° — » The implications of these tight bounds on a selected set of New Physics 

models is sketched. 
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1. Introduction 

Processes, which are highly suppressed in the Standard Model (SM), such as decays 
mediated by flavour changing neutral currents (FCNC) allow stringent tests of our 
current understanding of particle physics. These transitions are forbidden at tree 
level in the SM, as all electrically neutral particles (7, Z°, H° and gluons) have only 
diagonal couplings in the flavor space. FCNC processes are therefore only allowed 
through loop contributions and probe the underlying fundamental theory at the 
quantum level, where they are sensitive to masses much higher than that of the b- 
quark. Historically, many observations have first been indicated by FCNC processes, 
examples include the existence of the charm quark or the high top quark mass. 

Precise measurements of the branching fractions of the two FCNC decays B° — > 
/i+ fx~ and _B°— > fi + fi~ belong to the most promising modes for a possible discovery 
of a theory beyond the SM. These decays are strongly suppressed by loop and 
helicity factors, making the SM branching fraction small: 3.1 x 10 -9 for B® decays 
and 1 x 10~ 10 for B° decays, both known with a precision better than 10%. 

Enhancements of the decay rates of these decays are predicted in a variety of 
different New Physics models. It has been emphasized many times that this decay 
is very sensitive to the presence of supersymmetric particles I I I I I I I I [ For ex- 
ample, in the minimal supersymmetric Standard Model (MSSM), the enhancement 
is proportiona l 9 ! 10 ! 1 ^ to tan 6 /3, where tan /3 is the ratio of the vacuum expectation 
values of the two Higgs fields. For large values of tan/3, this search belongs to the 
most sensitive probes for physics beyond the SM which can be performed at collider 
experiments. Other models such as non minimal flavor violating or Littlest Higgs 
models as well as those with extra dimensions like Randall Sundrum models predict 
large effects independent of the value of tan ^I2|i3|i4|i5|i6|i7|i8|i£[ j n tne aDse nce of 
an observation, limits on B(B®— > are complementary to those provided by 

high px experiments. The interplay between both allows to optimally constrain the 
SUSY parameter space. 

Measuring the decay rates of these decays has been a major goal of particle 
physics experiments in the past decade. The limit on the decay rates was gradually 
improved by the CDF and DO experiments at the Tevatron and the CMS, ATLAS 
and LHCb experiments at the LHC. 

In this review, the prediction of the branching fraction of B® d — > is 
discussed in Sec. [2] The most sensitive measurements of the branching fractions of 
and B° —> are discussed in Sec.[3j including a combination of the 

measurements performed by the three LHC experiments. The review is closed by a 
brief discussion of the implications of these measurements on various extensions of 
the Standard Model of particle physics in Sec. [4] 

2. Theory expectation of the branching fractions 

In this section, the calculation of the branching fraction of £?° — > /i + /J~ and B° — > 
is first discussed in a model independent way, followed by the numerical SM 
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prediction. For simplicity of notation, the expressions are given solely for the decay 
B® — > [i + u~ . The corresponding expression for B° — > ^t + /i~ decays can be trivially 
obtained by exchanging the s with a d quark. 



2.1. Model independent discussion 

The branching fraction of B® — > can be expressed as low-energy effective 

Hamiltonian using the operator product expansion (OPE) which allows to separate 
the long distance contributions to the decay amplitude from the short distance con- 
tributions. The former are relegated to non-perturbative hadronic matrix elements 
whereas the latter are described by perturbatively calculable Wilson coefficients 
Cfc. A detailed discussion of the concept can be found in Ref. [20] The effective 
Hamiltonian for b— ¥ s£ + £~ transitions is given by 

A / ' 2 

v i 

where Gp is the Fermi constant, V t b and V ts are elements of the Cabibbo-Kobayashi- 
Maskawa (CKM) matrix and Cj are the Wilson coefficients. The four-fermion op- 
erators Oi which are relevant for b— > s£ + £~ decays are: 

O w = (s^P L b)(ir l5 l) , (2) 
O s = m b (sP R b)(ll), (3) 
Op = m b (sP R b){h 5 l) , (4) 

where Pl,r = ^(1 T 7s)- The corresponding operators 0\ are obtained from the 
Oi operators by replacing P^ with Pr. The notation used here follows the one of 
Ref. |2"T1 and l2"2l The branching ratio of B® — > fi + [i~ decays can then be written in 
a model independent way as 

B(B° S ^ fx+a-) = -JL^fl m l sTBs \ Vtb V t :\Jl ^ (5) 

f 4m?, . , l2 

x (I - -gH \C S ~ C' s \ + \(C P - C'p) + 2- 

{ m B s 1 

where fg a is the B® decay constant, mp, and Tp 3 are the B® mass and lifetime 
respectively. The contributions of the scalar (Cs, C' s ) and pseudoscalar [Cp, C' P ) 
operators enter in the branching fraction without suppression. As the correspond- 
ing Wilson coefficients are still largely unconstrained, there is significant room 
for contributions from New Physics models. In the SM, however, they are pre- 
dicted to be strictly vanishing. The contribution from electroweak penguin dia- 
grams, contained in the coefficients Cio and C' 1Q , is suppressed by a helicity factor 
(m^/m^o) 2 R3 4 x 10~ 4 . In the Standard Model, only Cio is non-zero and its value is 
given by the real coefficient Cf M . It is dominated by a Z° penguin loop (75%) and 
a box diagram (24%)p3. The two dominant SM Feynman diagrams contributing to 
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Fig. 1. The two dominating SM Feynman diagrams contributing to the decay B® — > /i+ fi . 



the B° s d — > decay are given in Fig. |l| The sensitivity of this decay to scalar 

and pseudo-scalar interactions enables stringent tests of New Physics models with 
an extended (pseudo-) scalar sector. 



2.2. Standard Model prediction 

The numerical value of the Standard Model prediction of the branching fraction of 
B®— > f.i + fJ.~, as given in Eq. can be calculated as 

B(B°-^ /i + /0 = 3.1 ±0.2 x 1CT 9 , (6) 

where the input parameters as discussed in Ref. [T7] are used. The dominant un- 
certainty comes from the lattice based calculation of the B® decay constant, which 
is used here from a recent lattice average^ as fg B = 227.7 ± 6.2 MeV. The lattice 
calculations made significant progress recently and thus made this very high preci- 
sion possibl j? 5 l 26 l 27 l Depending on the choice of the numerical values of the input 
parameters for both the lattice and the experimental quantities, different authors 
calculate slightly different branching fractiona 22 l 23 l 28 l 2 3, up to 3.6 x 10 -9 . A new lat- 
tice average by the FLAG-2 group, which represents all big lattice collaborations, is 
expected by the end of 2012, which will hopefully help to settle these discrepancies 
in the predicted numerical value of B(B°^ 

Alternatively to the evaluation of Eq. |5]) , the CKM dependencies and hadronic 
uncertainties can be eliminated by expressing the branching fraction as a function 
of the Si? — B® meson mass difference AM S . The dependence on the decay constant 
fs, is exchanged with the bag parameter Bb s , which can be calculated with smaller 
uncertainties. The branching fraction of B® — > /i + /i~ decays is then given ad 25 ! 30 ! 

B{B^ /X+/0 = 4.36 x 10- 10 x ^L^lAM s (7) 
= 3.2 ±0.2 x 10" 9 , (8) 



where Y 2 (v) and S(v) are the two master functions of CMFV models^ , evaluated 
in the SM with v = mf/M^. In Eq. the numerical value of B Bg = 1.33 ± 0.06 
from Ref. |2H and [33] is used. It is a remarkable success of the lattice calculations 
that both values determined in Eq. ([6| and Eq. ^ are in excellent agreement. 
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2.3. Comparison of the calculated BR with experiment 

The LHCb collaboration recently measured the width difference in the B® system 
in a time dependent analysis of B® — > J/ij>4> decays^ as Ar s = 0.116 ± 0.019ps _1 . 
If the calculated value of B(B®^ /i + p~) is compared to a measurement, the finite 
width difference has to be taken into account as the measured branching fraction is 
the time integrated one, 

1 r°° 

B(B° S ^ M + /O cxp ' (t> = g / ( r ( S °W /'VP, (9) 

whereas the the CP averaged branching fraction is calculated. To compare the 
measured value with a theory prediction, one of the two values has to be corrected, 
as pointed out recent lyUMH Here, a correction of the theoretical value (Eq. ^) is 
chosen which reads 

B(B° S ^ /i+/0™' {t) = x 0(fl s °-> M V~) CP = 3.4 ± 0.2 x lO" 9 , (10) 

1 - Us 

where the CP averaged branching ratio as given in Eq. ^ is taken and y B is given 
as 

Ar 

y. = -=*■ = 0.088 ±0.014. (11) 

ZL s 

To compare the SM prediction to a measured branching fraction or an exclusion 
limit, the value of the branching fraction of B®—> given in Eq. (10) has to be 

used. 



3. Experimental situation 

The search for B a s d — > has been performed both at the B-factories BaBar 

and Belle as well as at the experiments at the Tevatron and at the LHC. At the 
B-factories, running at the T(45) resonance, only B° decays are accessible whereas 
at the Tevatron and the LHC both B° and B® mesons are produced. 

A comparison of the the 66-production cross section at the different experiments 
is given in Tab. [l] The efficiency of the acceptance and px requirements is not cor- 
rected for, as the applied selections are typical for analyses at the given experiments. 
The production cross section at hadron colliders is, depending on collision energy 
and detector acceptance, between 6pb (CDF, DO) and 94 fib (LHCb). Note that 
the LHCb experiment is a single arm, forward spectrometer which integrates the 
peak of 6&-pairs produced in forward direction whereas the general purpose detec- 
tors ATLAS and CMS (GPD) have a limited acceptance in the forward or backward 
region but integrate over the complete remaining phase space. The effect of these 
acceptances is that the 66-production cross section is similar for the LHCb experi- 
ment and the GPD. The sensitivity of recent HEP experiments for the searches for 
B® d — > /-t + /i~ can be summarized as follows: 
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• Compared to the cross sections at the Tevatron and the LHC, the bb- 
production rate at e+e~ colliders at T(45) is more than three orders of 
magnitude smaller. Hence, the number of recorded 66-pairs is much lower 
and the searches for B® d — > /i + ~fi~ are with the existing datasets of 500 and 
1000 fb^ 1 not competitive. Therefore, the discussion in this review focuses 
on the hadron collider experiments. 

• The Tevatron collider experiments recorded a dataset corresponding to an 
integrated luminosity of about 10fb _1 in Run II between 2001 and 2011, 
with an estimated number of 6 x 10 10 66-pairs produced per experiment. 

• Since the LHC began collecting data in 2010, its experiments have taken 
over the lead of the field due to the very high 66-production cross section. 
At the general purpose detectors at the LHC, about 4 x 10 11 66-pairs have 
been produced in their dataset of 5fb _1 which has been collected at in- 
stantaneous luminosities up to 3.5 x 10 33 s _1 m -2 . The LHCb experiment, 
in contrast to the GPD, leveled its luminosity in 2011 to a constant value 
of 2 — 3 x 10 32 s _1 m~ 2 , giving a total dataset until the end of 2011 of 1 fb _1 
which contains 9 x 10 10 6&-pairs. 

Thanks to the very large number of 66-pairs produced and the clean experimental 
signature of the B® d — > decays, the hadron collider experiments have per- 

formed the most sensitive searches for the decays B° s d — > These are discussed 

in detail in the remainder of this section. 

3.1. Overview of the analyses and search strategies 

The search for the rare decays B® d — > has been performed at both Tevatron 

detectors. The analyses discussed here contain the published CDF analysis using 
7fb _1 of pp collision dataP^at y/s = 1.96 TeV as well as their unpublished update^ 
using the full dataset of 10fb _1 , collected until the shutdown of the Tevatron col- 
lider. The DO collaboration published a resultP^ with a dataset corresponding to 
an integrated luminosity of 6.1 fb -1 . The sensitivity of the DO detector is about a 
factor of two worse than the CDF sensitivity as the DO mass resolution is much 

Table 1. bb-production cross sections in the acceptance of recent HEP experiments. Data taken 
from Refs.|36]|37]|38][39] The cross section for ATLAS, CMS, CDF, LHCb is calculated from a 
_B+ cross section measurement with p-p > 5 GeV (CDF: px > 6 GeV, LHCb pt > GeV) using 
the LHCb measurement of the hadronization fractions^!. The number of produced 66-pairs 
estimated for the dataset recorded until the end of 2011 and does not include trigger and 
reconstruction efficiencies. 



experiment 


acceptance 


T(45) 


pp, 1.96 TeV 


pp, 7 TeV 


fob-pairs produced 


BaBar, Belle 


Itt 


1.05 nb 






~ 1 x 10 9 


CDF 


\v\ < i 




6.3 ±0.6 




~6x 10 10 


ATLAS, CMS 


\V\ < 2-2 






75 ± 17 ^b 


~4x 10 11 


LHCb 


2 < n < 6 






94 ± 8 ^b 


~9x 10 10 
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worse. For the decay B® — > fJ, + fi~, a mass resolution of 24MeV/c 2 is expected for 
the CDF experiment whereas 120MeV/c 2 is expected for the DO experiment. Due 
to its lower sensitivity, the DO result is not discussed further in this review. 

At the LHC, the most sensitive measurements of the branching ratios of 
B® d — > /i + fj,~ have been made by the LHCb collaboration, which published three 
measurement jffL 45 ! 46 ! using 37 pb -1 , 370pb _1 and lfb -1 of ^/s = 7 TeV pp-collision 
data, respectively. Both general purpose detectors at the LHC have performed 
searches for B° d — > n + fr , the ATLAS collaboration^ using 2.4 fb -1 of data and 
the CMS collaboration^!^ using 1 fb" 1 and 5 fb" 1 of ^/s = 7 TeV pp-collision data. 

The invariant mass resolution of the three LHC experiments, estimated for B® — > 
fj,^ ' \x~ signal candidates, is 

ATLAS : 60MeV/c 2 (\n\ < 1) - 110MeV/c 2 (|r?| > 1.5) , 
CMS : 37MeV/c 2 (\n\ ~ 0) - 77MeV/c 2 (\n\ > 1.8) , 
LHCb : 24.8 MeV/c 2 , 

which shows the invariant mass resolution of the LHCb forward spectrometer to be 
on average a factor two better than the CMS resolution and a factor 3-4 better than 
the ATLAS resolution. This, together with the better separation between signal and 
background in the boosted forward region and the cleaner environment compensates 
for the luminosity, which is a factor 5 lower in LHCb compared to the GPDs. 

The measurements performed by the CDF and LHCb collaborations use a similar 
strategy to search for B® d —} ii + fi~ decays. Triggered opposite sign dimuon can- 
didates are further selected to clean up the sample. To maximize the efficiency, a 
significant amount of background is allowed in the selection. The remaining events 
are classified in a two-dimensional plane spanned by the invariant mass and the 
signal likelihood formed from the two-prong decay signature. 

In contrast to this, the ATLAS and CMS collaborations use a tight selection 
which is cut based at CMS and based on a boosted decision tree in ATLAS. Both 
measurements are binned in rapidity, separating regions with different detector per- 
formance and therefore different signal to background ratios. 

The event selection and classification with a signal likelihood is discussed in 
Sec. |3.2| All experiments use a relative normalization to interpret the observed 
pattern of events as branching ratio, discussed in Sec. |3.3| and then use the mod- 
ified frequentist method (CL S ) to extract the limits on the branching fractions, as 
discussed in Sections |3.4| and |3.5| In Sec. |3.6| a combination of the searches for 
B® d — > [i + \jT of the three LHC experiments is discussed. 

3.2. Event selection and Signal Likelihood 

All experiments have highly efficient muon triggers, which are used both to select 
events for the signal and for the normalization channel (see, e.g., Refs. [501 or [5T1 for 
a detailed discussion). The dimuon triggers for the B® — > analysis obtained 
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a high priority in the physics programs of the GPDs, therefore, the high efficiency 
could be preserved during the high luminosity running. 

The fully reconstructed signal candidates can be identified by their clearly sep- 
arated secondary vertex, exploiting the long B® meson lifetime. The candidate mo- 
mentum vector will be aligned with the separation between primary and secondary 
vertex (pointing), and the two muon candidates will be isolated from other tracks 
in the event, due to the hard B fragmentation. 

Background events tend to be partially reconstructed and shorter lived than 
the signal. They also have a softer pt spectrum, lower degree of isolation and a 
less precise pointing. The dominant combinatorial background consists of sequen- 
tial semileptonic decays (b — > c[i~ — > fi + fj,~ X) and double semi-leptonic decays 
(b — > fi + X , b — > fJ,~X). The former mostly populate the lower mass sidebands 
whereas the latter cover the whole mass range. In the analyses, these combinatorial 
backgrounds are estimated by an extrapolation from the invariant mass sidebands. 

Peaking backgrounds, dominantly originating from misidentified B° s — > K + K~ , 
B° — > 7r + 7r~ and B° — > K + n~ decays, need to be evaluated separately. Here, 
the invariant mass line shape can be extracted from simulated samples of doubly 
misidentified B9-. — > h + h'~ events. The misidentification rates K — > /j, and 
tt^~ — > ^ are extracted in data from control channels such as D° — > K~n + . The 
contributions of exclusive decays such as £?+ — > J/^( / u + /i~)/x + z/ ([J or B® — > fi + fi~-f 
events is found to be negligible. 

3.2.1. Classification of events 

The CDF and LHCb measurements use a loose selection that allows significant 
amounts of background and then use a multivariate selection to discriminate sig- 
nal from background. In this review only the more sensitive analysis of the LHCb 
collaboration is discussed in detail. 



LHCb analysis of 1 fb 

The selected sample is first cleaned with a multivariate classifier based on six vari- 
ables that can be used equally for the signal and the normalization channels. This 
selection removes 80% of the residual background, while retaining 92% of the signal. 
Applying it improves the performance of the main multivariate selection described 
below. 

About 17000 dimuon candidates pass this selection while 11.6 B® — > and 
1.3 B° — > n + jJT candidates are expected, assuming SM rates. The selected dimuon 
candidates are classified in a binned two-dimensional space formed by the dimuon 
invariant mass and the output of another boosted decision tree (BDT) which com- 
bines nine variables to optimally exploit geometrical and kinematic information. 
The variables used to construct the BDT include the B candidate pointing, decay 
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time and px, the displacement of the muons from the primary vertex and the dis- 
tance between the muon pairs as well as a measure of the track and B candidate 
isolation. 

No data were used for the choice of the variables and the subsequent training of 
the BDT, to avoid biasing the results. Instead the BDT was trained using simulated 
samples, — > for signal and bb —> fi + n~X for background. It is defined such 

that for the signal it is approximately uniformly distributed between zero and one, 
while for the background it peaks at zero. Of the 17321 dimuon candidates after 
the selection^] only 95 are in the region of high signal likelihood (BDT> 0.5) which 
contains about 50% of the expected signal. 

The probability for a signal event to have a given BDT value is obtained from 
data using an inclusive B9s — > h + h'~ sample. Only events triggered independently 
of the signal candidates are considered. The fraction of B9-. — > h + h'~ signal can- 
didates in each bin of BDT is determined by fitting the h + h!~ invariant mass 
distribution. 

The chosen number and size of the bins are a compromise between maximizing 
the number of bins and the necessity to have enough B9^ —> h + h'~ events to 
calibrate the signal BDT and enough background in the mass sidebands to estimate 
the combinatorial background in the B® and B° mass regions. The BDT range is 
thus divided into eight bins and the invariant mass range into nine bins. 



CDF analysis 

The CDF collaboration uses a similar multivariate classifier (called v^) which is 
based on an artificial neural net combining 14 input variables. The sample is binned 
in 8 bins of and 5 bins of invariant mass. The signal shape of is taken from 
the simulation and cross checked with B~ — s- J/ipK~ events. 

The evaluation of the compatibility of the observed pattern of events with the 
background or signal hypotheses is then performed in these bins and will be de- 
scribed in detail in Sec. 13.41 and 13.51 

3.2.2. Cut based separation of signal and background 

The ATLAS and CMS measurements use a tight selection of the signal on which 
the background is cut away. The ATLAS selection is based on a boosted decision 
tree combining 14 variables. To ensure that the data are reproduced, the simulation 
is tuned with an iterative reweighting procedure. 

The CMS collaboration optimizes a cut based selection on MC signal and data 
sideband events. A multivariate analysis is in preparation which is expected to give 
an improvement of up to 20% relative to the cut based analysis. 

a The number of events quoted here are in a mass range between 4900MeV/c 2 and 6000MeV/c 2 . 



10 J. Albrecht 



3.3. Relative Normalization 

The branching fraction for the — > signal is measured by all analyses 

discussed here relative to a channel with a well known branching fraction as 

, t N B a ^ u + u - 

B = B noxm x e J^ x i^ x g ; (12 ) 

^sig Jd(s) ^'norrn 

where the branching fraction is determined by £> norm , the branching fraction of the 
normalization channel. It needs to be corrected by the relative efficiencies e n0 rmAsig) 
the probabilities that a b quark fragments into a B^ and into the b hadron involved 
for the chosen normalization mode / n0 rm//d(s) an d by the ratio of observed candi- 
dates in the signal and the normalization mode, N B o _ >IJi + IJi -/N noIxn . This relative 
normalization has many advantages over a absolute normalization: the luminosity 
and production cross section is not required to be known and the determination of 
relative efficiencies is more robust than the determination of absolute efficiencies. 

All experiments discussed here measure B(B9s — > /i + /i~) relative to the B~ —> 
J/ip(fj, + [i~)K~ channel, which has a similar muon identification and trigger, which 
minimizes systematic uncertainties. Its branching fraction is known to a precision 
of 3.59p^l. However, as it is a three body decay, the detector acceptance as well as 
the kinematic distributions of the tracks differ and need to be corrected for. 

The LHCb collaboration performs the only measurement that uses the weighted 
average of three normalization channels, additionally B® — > J/ip(fi + fi~)cj)(K + K~) 
and B9s — > h + h'~ . These three channels have different advantages as normalization: 
the _B" — > J I ipej) decay is a B® decay and hence the fraction of hadronization proba- 
bilities cancels. However, the branching fraction of this decay is currently known to 
a precision of only 26% ^2. The B9*. —> h + h'~ channel has an identical two body sig- 
nature as the signal decay, but has been selected by very different hadronic trigger 
than the dimuon signal. Therefore, only Z?9 ^ h + h'~ candidates that are unbiased 
by the trigger selections are used for the normalization. While the normalization of 
the average of three channels significantly improves the robustness of the analysis 
and reduces the dependence on the simulation, the B + channel strongly dominates 
the average. 

The mass distributions of the selected B~ — > J/ipK~ candidates are shown in 
Fig. [5] for the four experiment discussed here. Please note the different scale of the 
figures, while the resolution of the LHCb and CDF experiments is good enough to 
clearly separate the shoulder of partially reconstructed B® — > J/ipK*° events from 
the signal, the ATLAS and CMS experiments need to model it. 

The relative efficiencies arc determined by all experiments from the simulation 
and have been verified with B® — > J/^p(p, B^ — > J/iJjK* and B^ — > h + h'~ events in 
data. The uncertainties the efficiency ratio contributes to the normalization factors 
vary between 3% (LHCb) and 16% (ATLAS end-cap). 

For the normalization to B + candidates, the ratio of 6-hadronization fractions 
need to be included in Eq. [12] The CDF experiment uses the PDG average for the 
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Fig. 2. Invariant mass distribution of the B — > J/4>K candidates used in the normalization 
procedure. Note the different scale on the Figures. Reproduced from CDpEU, ATLA^S, CM^Sl 
and LHClJ^l. 



hadronization fractional, which has an uncertainty of 10%. The LHC experiments 
all use the measurement performed by the LHCb collaboration'' ', which has an 
uncertainty of 7.8%. The ATLAS and CMS collaborations use this measurement 
assuming it is valid for all momentum and rapidity regions. 



3.4. Extraction of the branching ratio 

The branching fraction limit is extracted from the pattern of events using the 
modified frequentist methocEEH (CL S ), where the background only (B) and sig- 
nal plus background (S+B) hypotheses are tested. The logarithmic form of the 
likelihood ratio 

_ 21nQ - -21n^ - -21nTT + (13) 

ZYnQ- 2m ^ - Iln^ e - bi ^ / d .i > ^ 

depends on the expected number of signal (s^ and background (bi) events in each 
bin i, as well as on the observed number of events (di). The method provides a 
measure of the compatibility of the observed distribution with the signal plus back- 
ground hypothesis, CL s+ b, as well as a measure for the compatibility with the back- 
ground only hypothesis, CL b , which are both derived from pseudo experiments. The 
fraction of signal plus background (background only) events with a value of —2 In Q 
smaller than or equal to the observed valve is computed. Systematic uncertainties 
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are included in the pseudo experiments as nuisance parameters using Poisson dis- 
tributions for the uncertainties with statistical nature and bifurcated Gaussians for 
the others. 

The ratio of both confidence levels, CL S = CL s+ b/CLb, is used to set an exclu- 
sion limit on the branching fraction of the two decays to protect against negative 
statistical fluctuations of the background which could lead to an exclusion of the 
null hypothesis without experimental sensitivity. To claim evidence of the decays, 
1 — CLb is used as a p- value. A 95% confidence level exclusion corresponds to a CL S 
value of 0.05 and a 3a evidence to a value of 1 — CLb = 2.7 x 10~ 3 for a one sided 
definition of the significance. 

3.5. Results 

The CDF collaboration observes an excess of candidates in the -B 9 — > signal 
region. It is concentrated in bins with a high signal likelihood, vjy > 0.97. There is 
also an excess in the medium likelihood bins, 0.97 < < 0.987, which appears to 
be a statistical fluctuation of the background as there is no significant expectation 
of B° s —> signal consistent with the observation in the two highest bins. The 

p-value for a background-only hypothesis is 0.94%, when considering only the two 
highest vn bins the p-value becomes 2.1%. The branching fraction corresponding 
to the observed pattern of events is determined from a log-likelihood fit and is 
determined to be 

B(B° S ^ a* + aO CDF = 13t 9 7 x 10- 9 . (14) 

Additionally, bounds are set on the branching fraction using the CL S method as 
B(B° s ^r fi+fj,-) < 31 x 10" 9 at 95% CL. 

The excess seen by the CDF collaboration has not been confirmed by the LHC 
experiments, which are summarized in Tab. [2j The ATLAS collaboration analyses 
2.4fb _1 of data and observes no excess of candidates and determines an upper 
limit of B(B® —> < 22 x 10 -9 , being very close to the value expected from 

background extrapolation. 

The CMS collaboration analyses 4.9fb _1 of data and observes the pattern of 
events shown in Fig. [3] (top). Six events are observed in the B® —> /i + fi~ signal 
region, of which two are in the barrel and four in the end-cap. This pattern of events 
is consistent with the expectation from the Standard Model branching fraction plus 
background. The upper limits combined for both regions are at 95% CL: 

B(B°^ /i+ M -) CMS < 7.7 x 10~ 9 and (15) 
S(_B°— > £i + /i~) CMS < 1.8 x 10~ 9 , (16) 

which is well comparable to the median expected upper limits of 8.4 x 10~ 9 for the 
B® mode and 1.6 x 10~ 9 for B° mode. 

The LHCb collaboration analyses 1 fb _1 of data and observes the distribution 
of the invariant mass for BDT> 0.5 as shown in Fig. [3] (bottom) for f? 9 and B° 
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Table 2. Expected and observed upper limit on B(B® d — > ff^fj,~) for the most recent measurements, 
all given at 95% CL. For the expected limits, it is indicated if the SM signal plus background or the 
background only case is assumed. The CDF collaboration observes an excess of £?J — ► /z+ fj,~ candidates 
and quotes bounds at 95% CL on the branching fraction: 0.8 X 10~ 9 < B(B°-t M + aO < 34 x 10~ 9 . 



experiment 


luminosity 


expected 


observed 

xlO" 


B ^ 
expected 

-9 


observed 


DO 


61b" 1 


40 ( bk s) 


51 






CDF 


lOfb" 1 


13 ( bk g) 


31 


40 ( bk s) 


46 


ATLAS 


2.4fb~ 1 


23 ( bk s) 


22 






CMS 


4.9fb~ 1 


8 4 (SM+bkg) 


7.7 


1.6 ( bkg ' 


1.8 


LHCb 


lfb" 1 


y 2 (SM+bkg) 


4.5 


1.1 (bkg) 


1.0 



decays. The expected limits are computed allowing the presence of — > /i + /i~ 
events according to the SM rate, including cross feed between the two modes. A 
limit of 7.2(1.1) x 10 -9 is expected for the B® and B° modes respectively. The 
analysis of the observed pattern of events gives 

B(B° S ^ M V-) LHCb < 4.5 x 1(T 9 and (17) 
B(B ^ M V~) LHCb < 1-0 x 1(T 9 . (18) 

This is a downward fluctuation of la for the B® mode with respect to the SM expec- 
tation. A simultaneous unbinned likelihood fit to the mass projections in the eight 
BDT bins has been performed to determine the B® — > branching fraction. 

The fit gives 

B{B° S ^ ^+ M -) LHCb = 0.8t\i x 1(T 9 , (19) 

where the central value is extracted from the maximum of the logarithm of the profile 
likelihood and the uncertainty contains both statistical fluctuations and systematic 
uncertainties. 

In comparison to the CMS measurement, LHCb has a 15% higher sensitivity 
which shows that the better detector resolution (both in mass and vertex precision) 
and the more advanced analysis overbalance the factor five higher luminosity that 
the CMS experiment has recorded. Due to a downward fluctuation of the data, the 
measured upper exclusion limit on the decay B® — > is about 40% below the 

CMS limit. 

3.6. Combination of B° d — >• /x+ [i~ measurements 

The results of the analyses performed on the 2011 LHC dataset of the ATLAS, CMS 
and LHCb collaborations have been combined 56 . As the LHCb collaboration has 
also published a measurement using the data collected in 2010, the measurements 
performed on both datasets are used in the combination. 
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Fig. 3. Distribution of selected signal candidates. Top: Events observed in CMS in the barrel (left) 
and end-cap (right) channels. Bottom: Events observed in LHCb in the B® channel (left) and the 
B° channel (right) for BDT> 0.5 and expectation for, from top, SM signal (gray), combinatorial 
background (light gray), -B^) — ► h + h'~ background (black) and cross feed between both modes 
(dark gray). The hatched area depicts the uncertainty on the total background expectation. Figures 
reproduced from CM^SO and LHCtM 



The combination uses the modified frequentist approach, as described in Sec. 3.4 



to combine the individual measurements. The expected and observed CL S values 
of the three experiments are shown in Fig. [4] (left) as a function of the assumed 
branching ratio of B° — > A exclusion limit of B(B®^ /i + /i~) < 4.2 x 10~ 9 is 

determined. The compatibility of the determined branching ratio with the Standard 
Model value (see Sec. 2.3 1 is evaluated as 1 — CL S+D = 0.84. The combination 
of the three LHC experiments shows a moderate excess of the branching ratio of 
B® —> /i + yT over the background only hypothesis of 1 — CL D = 0.05. The combined 
CL S curve for the B° — » n + [iT decay is shown in Pig. [1] (right). The observed upper 
limit on the branching fraction is found to be 8.1 x 10 -10 at 95% CL. 

The combined upper limits on B® — > and B° — > are summarized 

in Tab. [3j they improve the limits obtained by the individual experiments and 
represent the best existing limits on these decays. The combination improves the 
expected limit with respect to the LHCb analysis alone by 15% assuming a signal 
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B(B°^n*n-)x10' B B(B°->n*n')x10- 1 ° 

Fig. 4. CL S as a function of the assumed B for B® — ¥ fi + fJ.~ (left) and B° — > fi + fJ.~ (right). The 
dashed black curves are the medians of the expected CL S distributions, if background and SM 
signal were observed (for B®—> [i + fi~) and in absence of signal (fi°— > fi + fjt~). The green (yellow) 
areas cover the ±1(2)<t intervals. The solid blue curves are the observed CL S . The upper limits 
at 90 % (95 %) C.L. are indicated by the dotted (solid) horizontal lines. Figure reproduced from 
Ref.l56l 



Table 3. Expected and observed upper limit on B(B® , — > /i + ^t~) for the combination of 
the LHC analyses, all given at 95% CL. The expected limits correspond to the median cases 
in which only background or SM signal and background events were observed. 







Expected 


Observed 






bkg only SM + bkg 




B(B°- 


* At+/i _ ) xicr 9 


2.3 6.1 


4.2 


B(B°- 


» n+i-i ) xicr 10 


7.3 


8.1 



at the SM rate any by 32% assuming the absence of any signal. Despite this largely 
improved sensitivity, the effect in the observed upper limit is reduced as the LHCb 
experiment observes a downward fluctuation of signal events with respect to the 
SM prediction. 

4. Implications 

The exclusion limit on the branching fraction of the decay B® —> on ly 20% 

above the SM expectation provides stringent tests of possible extensions of the 
Standard Model. Especially the scalar sector, as discussed in Sec. [2j is strongly 
constrained by the exclusion of large enhancements in the branching fraction. 

The implications of the existing measurements are discussed in the context of 
two simplified variants of the minimal supersymmetric extension of the Standard 
Model (MSSMPEI: the constrained MSSM (CMSSM) 59 and non-universal Higgs 
Masses of type 1 (NUHMlj^l. In the CMSSM, the number of free parameters is 
reduced to five: tioj to i/2 and A , denoting common scalar, fermionic and trilinear 
soft supersymmetry-breaking parameters at the GUT scale, and tan/3, sgn/j, denot- 
ing the ratio of the vacuum expectation values of the two Higgs fields and the sign 
of the mass term respectively. In the NUHM1 model, the universality condition for 
the Higgs bosons are decoupled from the other scalars, adding two extra parameters 
to the model. 
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CMSSM - tan |i=50, A =0 CMSSM - tan p=30, A =0 
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Fig. 5. Constraints from flavor observables in CMSSM in the plane (mi/2; m o) with Ao = 0, for 
tan/3 = 50 in the left and 30 in the right. The black line corresponds to the CMS exclusion limit 
with 1.1 fb of data and the red line to the CMS exclusion limit with 4.4 fb of data. Figures 
reproduced from Ref. 1611 



A analysis of the constrains imposed by the different flavor observables in the 
CMSSM model is done, for example, in Refs. HHI or IHU Fig. [5] shows the plane 
(m 1 / 2 ,TOo) for large (left) and moderate (right) values of tan/? in the CMSSM. 
Direct search limits from CMS are superimposed for comparison. It can be seen 
that, at large values of tan f3, the constraints from — > are stronger than 

those from direct searches. At smaller values of tan /3, the flavor observables start 
to lose importance compared to direct searches. In this regime the other flavor 
observables, in particular the observables measured in B° —> K*fi + n~ decays, loose 
less sensitivity and hence play a complementary role. In more general MSSM models, 
the parameter space is significantly less constrained. 

The influence of the searches for B® d —> n + fi~ on various supersymmetric mod- 
els can be seen in a global analysis of the constraints on these models from the 
measurements in the high and in the flavor sectoi^l. A global fit to all input 
excluding B(B® — > /i + /i~) predicts in the CMSSM an enhancement up to a factor 
of two and in NUHM1 up to factor of three. The current limits on B® — > 
exclude the minimum of the global fits in both the CMSSM and NUHM1 case. 

Another aspect is the interplay between searches for B° s d — > and Higgs 

physics, since any viable model point has to be in agreement with all direct and 
indirect measurements. For example, the Higgs boson recently discovere d 63 ! I at a 
mass of about 125 GeV/c 2 rules out MSSM scenarios in which the signal corresponds 
to the heaviest CP-even Higgs (as opposed to the lightest Higgs), as this would 
imply a too light pseudo-scalar Higgs to be consistent with the existing constraints 
on B(B®^ /i + /i~). See Ref. l65l for a more detailed discussion. 

It is clear that with more precise measurements on the branching fraction of 
B® d — > large parts of the supersymmetric parameter space could be dis- 

favoured, in particular the region of high values of tan f3. Because of the difficulty to 
access this range in direct searches, the measurements of B® d — » are a crucial 

element in the exploration or exclusion of supersymmetry. Also, a measurement of 
BR(£?1? — > n + fi~) lower than the SM prediction would rule out a large variety of 
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supersymmetric models. 

Once an evidence for the decay B® — > ^t + /i~ is found, the ratio of the branching 
fraction of the B® — > fi + fi~ and the B° —> decays becomes accessible. This 

ratio is essentially free of hadronic uncertainties and is predicted to be^ 

B(B° S ^ fSjjr) B Bd r Bs AM S mfv 
B(B^^fi-) B Bs r Bd AM d ~ ' 

where the lifetime t Bs d and mixing frequency A¥ S) j are experimental input and 
ratio of the bag parameters is well known, B Bd /B Bg = 0.95 ± 0.06. A measure- 
ment of this ratio provides a stringent test of the minimal flavor violation (MFV) 
hypothesijP. 

The correlation between both decays also allows to discriminate between classes 
of new physics models, as shown in Fig. [6j A large part of the parameter space 
of the supersymmetric models, where tan/3 can be large, is ruled out by the con- 
straints. However, in models where NP enters via the semi-leptonic operators 0±q, 
such as the Standard Model with a sequential fourth generation (SM4) or Randall- 
Sundrum models (RSc) are starting to be probed only now. See Ref. l67l for a detailed 
discussion of all models shown in Fig. [6j 




10 9 x BR(B, -» n + iT) 

Fig. 6. Correlation between the branching fractions of the — > fi + and B° — > decays 
in MFV, the SM4 and four SUSY flavor models. The SM point is marked by a star and the grey 
area is excluded by the LHCb measurement given in Eq. fXTD and \18) . Figure reproduced from 
Ref.l68l 



5. Summary 

The experimental situation of the searches for the very rare decays B® — > and 
B Q — > has been reviewed. The data contains a moderate excess of B® — > 

signal candidates with a p-value of the background only hypothesis of 0.05. The 
current most precise upper exclusion limits from a single experiment are measured 
by the LHCb collaboration as B{B® — > < 4.5 x 10~ 9 , the combination of 

the three LHC experiments is even more stringent at 4.2 x 10~ 9 . This limit is only 
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20% above the SM prediction, which puts tight constraints on various extensions of 
the Standard Model, especially on supersymmetric models at high values of tan j3. 
The decay B — > is constrained by the existing measurements to an upper 

exclusion limit of 8 x 10~ 10 . 

The tight experimental bounds on B(B® — » /i + /i~) precludes the optimistic 
predictions by various supersymmetric models of enhancements of several orders of 
magnitude. However, a possible exclusion of the Standard Model rate would still 
be a clear indication of New Physics. It needs also to be noted that the bounds 
on B(B° — > are still one order of magnitude above the predicted SM value, 

which means that the most promising channel to search for New Physics could be 
the B° and not the B® decay. 

The CMS and LHCb collaborations have excellent prospects to observe the decay 
Bj-^ with the dataset collected in 2012. This observation, and the precision 

measurement of B(B®^- /i + /i~) in the coming years will allow to strongly constrain 
the scalar sector of any extension of the Standard Model. The next step will be to 
limit and later measure the ratio of the decay rates of B° —> /B° —> /i + yT , 

which allows a stringent test of the hypothesis of minimal flavor violation and a 
good discrimination between various extensions of the Standard Model. 
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